DAP: LSTM-CRF Auto-encoder

نویسندگان

  • Yuan Liu
  • Matthew R. Gormley
چکیده

The LSTM-CRF is a hybrid graphical model which achieves state-of-the-art performance in supervised sequence labeling tasks. Collecting labeled data consumes lots of human resources and time. Thus, we want to improve the performance of LSTM-CRF by semi-supervised learning. Typically, people use pre-trained word representation to initialize models embedding layer from unlabeled data. However, these word representations are modelagnostic, which means they were trained without knowledge of the LSTM-CRFs structure. Thus, we introduce auto-encoder training for the LSTM-CRF to tune the models parameters by adding a decoder after the CRF. We compare the performance of these two methods and find that while the auto-encoder can improve performance in some situation, the model-agnostic approach achieves more universal improvement. We also how to use minibatches in LSTM-CRF.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Human Centred Object Co-Segmentation

Co-segmentation is the automatic extraction of the common semantic regions given a set of images. Different from previous approaches mainly based on object visuals, in this paper, we propose a human centred object co-segmentation approach, which uses the human as another strong evidence. In order to discover the rich internal structure of the objects reflecting their human-object interactions a...

متن کامل

Bidirectional LSTM-CRF Models for Sequence Tagging

In this paper, we propose a variety of Long Short-Term Memory (LSTM) based models for sequence tagging. These models include LSTM networks, bidirectional LSTM (BI-LSTM) networks, LSTM with a Conditional Random Field (CRF) layer (LSTM-CRF) and bidirectional LSTM with a CRF layer (BI-LSTM-CRF). Our work is the first to apply a bidirectional LSTM CRF (denoted as BI-LSTM-CRF) model to NLP benchmark...

متن کامل

Using Stacked Sparse Auto-Encoder and Superpixel CRF for Long-Term Visual Scene Understanding of UGVs

Multiple images have been widely used for scene understanding and navigation of unmanned ground vehicles in long term operations. However, as the amount of visual data in multiple images is huge, the cumulative error in many cases becomes untenable. This paper proposes a novel method that can extract features from a large dataset of multiple images efficiently. Then the membership K-means clust...

متن کامل

An Empirical Study on Energy Disaggregation via Deep Learning

Energy disaggregation is the task of estimating power consumption of each individual appliance from the whole-house electric signals. In this paper, we study this task based on deep learning methods which have achieved a lot of success in various domains recently. We introduce the feature extraction method that uses multiple parallel convolutional layers of varying filter sizes and present an L...

متن کامل

Diversity encouraged learning of unsupervised LSTM ensemble for neural activity video prediction

Being able to predict the neural signal in the near future from the current and previous observations has the potential to enable real-time responsive brain stimulation to suppress seizures. We have investigated how to use an auto-encoder model consisting of LSTM cells for such prediction. Recognizing that there exist multiple activity pattern clusters, we have further explored to train an ense...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017